Pronunciation Modelling and Lexical Adaptation in Mid-size Vocabulary Asr

نویسندگان

Louis ten Bosch

Nick Cremelie

چکیده

A computational-phonological method is presented to automatically adapt the phone transcriptions in a lexicon to improve ASR performance in a number of mid-size recognition tasks. The lexical adaptation approach is based on supervised phoneme loops using cd-HMM segments to find alternatives for the transcriptions, and can be considered as a counterpart of the K-means algorithm but on symbolic level. The word error rate in a limited task (digit string recognition) with dialect speakers is shown to drop by 20-25 percent relative, starting from non-dialect digit transcriptions. Since the method is computationally involving, it is only feasible for relatively small tasks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High- and Mid-Frequency Vocabulary Size as Predictors of Iranian University EFL Students’ Speaking Performance

Literature is replete with the studies focusing on the role of vocabulary knowledge in second language receptive skills. However, the relationship between the aspects of vocabulary knowledge and productive skills in general, and the speaking performance in particular has remained scanty in the related literature. This paper examined the relationship between knowledge of L2 vocabulary size at di...

متن کامل

MHATLex: Lexical Resources for Modelling the French Pronunciation

The aim of this paper is to introduce the lexical resources and environment, called MHATLex, and intended for speech and text processing. A particular attention is paid to a pronunciation modelling which can be used in automatic speech processing as well as in phonological/phonetic description of languages. In our paper we will introduce a pronunciation model, the MHAT model (Markovian Harmonic...

متن کامل

Unsupervised topic adaptation for morph-based speech recognition

Topic adaptation in automatic speech recognition (ASR) refers to the adaptation of language model and vocabulary for improved recognition of in-domain speech data. In this work we implement unsupervised topic adaptation for morph-based ASR, to improve recognition of foreign entity names. Based on first-pass ASR hypothesis similar texts are selected from a collection of articles, which are used ...

متن کامل

Data-driven lexical modeling of pronunciation variations for ASR

In this paper a method for the automatic construction of a lexicon with multiple entries per word is described. The basic idea is to transform a reference word transcription by means of stochastic pronunciation rules that can be learned automatically. This approach already proved its potential (Cremelie & Martens, 1999), and is now brought to a much higher level of performance. Relative reducti...

متن کامل

Evaluation of Pronunciation Variants in the ASR Lexicon for Different Speaking Styles

One of the challenges in automatic speech recognition is how to handle pronunciation variation. The main causes for pronunciation variation are the speaker (voice characteristics, accent, non-nativeness etc.) and the speaking style (reading, spontaneous responses, conversation etc.). An ASR system has basically two options for modelling the variation on the word and sub-word level: lexical mode...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2001

Pronunciation Modelling and Lexical Adaptation in Mid-size Vocabulary Asr

نویسندگان

چکیده

منابع مشابه

High- and Mid-Frequency Vocabulary Size as Predictors of Iranian University EFL Students’ Speaking Performance

MHATLex: Lexical Resources for Modelling the French Pronunciation

Unsupervised topic adaptation for morph-based speech recognition

Data-driven lexical modeling of pronunciation variations for ASR

Evaluation of Pronunciation Variants in the ASR Lexicon for Different Speaking Styles

عنوان ژورنال:

اشتراک گذاری